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Abstract 



Aims. In this paper we present a classification of emission-line galaxies at intermediate and high redshifts (0.5 < z < 1.0 for optical 
spectra, z > 2.5 for near-infrared spectra), using the Z)„(4000) index as a supplementary diagnostic. Our goal is to complement the 
diagnostic based only on emission-line ratios from the blue part of the spectra, which suffer from some limitations for the classifica- 
tion of Seyfert 2 and composite galaxies. 

Methods. We used a sample of 89 379 galaxies with a good signal-to-noise ratio from the Sloan Digital Sky Survey (data re- 
lease 7). Using the classification scheme presented in Paper I, we classified these galaxies with a diagnostic diagram involving the 
[Olll]A5007/Hj3 and [Oll]AA3726 + 3729/Hj3 emission-line ratios. Then we derived a supplementary diagnostic involving Z)„(4000) 
to improve this classification, in the regions where objects of different types are mixed. To show the validity of our spectral clas- 
sification we established success-rate and contamination charts, then we compared our results to those obtained with the reference 
classification that was scheme obtained also using Ha, [Nll]A6584, and [Sll]AA6717-i-6731 emission lines. 

Results. We show that our supplementary classification based on the D„(4000) index allows to separate unambiguously star-forming 
galaxies from Seyfert 2 in the region where they were mixed in Paper I. It also significantly reduces the region where star- forming 
galaxies are mixed with composites. 

Key words. Galaxies: active ; Galaxies: high-redshift ; Galaxies: Seyfert ; Galaxies: fundamental parameters 



1. Introduction 

There are several existing types of emission-line galaxies: the 
two main classes are star-forming galaxies (hereafter SFG) and 
active galactic nuclei (hereafter AGN). Emission lines are ob- 
served in star-forming galaxies because gas is ionized by new hot 
stars. In contrast, AGN galaxies contain a supermassive black- 
hole, and their emission lines come from gas ionization by the 
r* ] light emitted from their accretion disk. AGN can be classified 
, 1^ . in several types, but we only consider narrow-line AGNs, which 
K> ■ can be confused with SFG, i.e. Seyfert 2 galaxies and LINERs 
^ (low-ionization nuclear emission-line region). We do not con- 
d sider Seyfert 1 galaxies because they can be easily distinguished 
from SFGs by their wide Balmer emission lines. A third class of 
emission-line galaxies is what we call "composites". Composites 
show emission lines which are due both to recent star formation 
and to an AGN. 

To classify emission-line galaxies, one may use two diag- 
nostic diagrams depending on the redshift range: the first one is 
kn own as the BPT dia gnostic ( Baldwin et al.lll98 1), later studied 
bv lKewlev et al.l 12001) who used it to separate AGN fr om SFG 
thanks to theoretical models. iKauffmann et al.l (l2003ah revised 
Kewley's work and allowed going deeper into the classification 
process by showing a third type of galaxies calle d composites. It 
was then again revised by iKewley et al.l (l2006h . who improved 
the classification of AGNs into Seyfert 2 and LINERs. This 



diagnostic uses log([Oiii]A5007/Hj3) vs. log([Nii]A6583/Ha) 
and log([Oiii] A5007/HJ3) vs. log([Sii]AA6717 + 6731/Ha) di- 
agrams and may be used up to z < 0.5 with optical spectro- 
graphs. Other d iagnostics have been used in the past in the 
same diagrams (lHeckman[J l980': 'V eilleux & Qsterbrockl Il987l 
iHo et al.l Il997h . We use |Kewley et alJ (l2006h as a reference 
since it is the latest widely used diagno stic and is based o n 
the biggest sample. We refer the reader to iKewley et al.l (l2006l) . 
IConstantin & Vogeleyl (l2006l) . and re ferences herein for co mpar- 
isons of these diagnostics. See also iGroves et"an (l2006l) for a 
specific discussion on low-metallicity AGNs. 

The se c ond diagnostic was or iginally propose d by 
iTresse et al.1 (Il996h and studied later bv' Rola et all (Il997b . This 
diagnostic is useful at intermediate and high redshift when some 
emission lines used in the BPT diagnostic are n o longer observed 
by ge tting red- shifted out of spectrographs. iLamareille et al.l 
(|2004 hereafter L04) established a classification using em- 
pirical demarcation lines in the diagnostic diagram showing 
log([Oiii]A5007/Hj3) vs. log([Oii]AA3726 + 3729/Hj3), 
which may be used up to z < 1.0 with optical spectrographs, or 
even at z ^ 2.5 with near-infrared spect rographs (where op tical 
diagnostics cannot be used). In Paper I (iLamarei ll3 l20TQli . one 
of us proposed revised equations f or the classification that we 
use in this paper. We know that the ILama reille ( 2010, hereafter 
LIO) diagnostic implies a loss of Seyfert 2 galaxies, because of 
the region where Seyfert 2 and SFGs get mixed. As discussed in 
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Paper I, the LIO diagnostic also cannot unambiguously separate 
composites from SFGs or LINERs. The goal of this paper is 
to try to solve these two limitations with a different approach. 
Following the idea un der the "DEW" diagnostic introduced by 
IStasinska et alJ (I2QQ6 ), we us e the Z)^( 4 000) i ndex to derive 
a supplementary diagnostic. lYan et al.l (l2Qlll) have already 
derived a similar new diagnostic based on U — B rest-frame 
colors. Compared to the present paper, it does suffer from the 
following limitations. It is based on rest- frame colors whose 
calculation may suffer from biases from imperfect ^-correction 
at high redshift (unless such colors are integrated directly from 
the spectra), it does not provide a distinction between Seyfert 
2 galaxies and LINERs, and it does not provide a way to 
isolate at least a fraction of composite galaxies. Conversely, that 
diagnostic has the advantage of only relying on the detection of 
[Oiii] A5007 and Hj3 emission lines. 

Our goal is to provide a diagnostic that can be used 
to classify intermediate- or high-redshift emission-line galax- 
ies as closely as possible to local universe studies. The 
older L04 diagnostic has already been used in various stud- 
ies, such as star formation r ates (Maier et al. 20091 metal- 
licities (Mouhci ne etaP l2006l: iLaniareille et al] l2006ah . AGN 
populations (Bon giomo et al.l 1201 0|). gamma ray bu rst hosts 
(ISavaglio et ali20 09). and clusters (iLoubser et al.l2009h . Results 
provided in Paper I and here may be used to revise spectral 
classification of emission-line galaxies in interme diate redshif t 
optica l galaxy redshift surveys such as VVDS (|Le Fevre et al.l 



l2005l:[Garim et ani2008h . zCQ SMOS £illv et al.ll2009h . DEEP2 
([Davis et al.1 l2003h GDDS (lAbraham et al.1 l2004l) . GOODS 
("Balestra et al and others. We hope it will also serve as a 

reference for ongoing or future high-redshift surveys invo lving 
future spectrographs: in the optical, MUSE on VL T (Baco n et al.l 
1201 Q) or DIORAMAS on FELT (iLe Fevre et al. 2 010,); or in 
near-infrared (at z ^ , 2.5), EMIR on GTC (Garzon et al 
IContini et al.1 [20051), KMOS on VLT (fsharples et^all 
MOSFIRE on Keck ([McLean et all2008h . 
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It i s worth mentioning here as a warning that [Stasinska et al.[ 
(l2QQ8[) demonstrate that a fraction - whose value is still un- 
certain - of the galaxies classified as LINERs or composites 
by emission-line diagnostics may be actually "retired" galax- 
ies. Ionization in such galaxies would be produced by post-AGB 
stars and white dwarfs. The reader should therefore be aware 
that galaxies that w e refer to as LINERs or co mposites might 
not contain an AGN. [Cid Femandes et al. have derived a 

diagnostic that isolates this class of "retired" galaxies. This di- 
agnostic is based on Ha and [Nll]A6583 emission lines. It is, 
however, beyond the scope of this paper to derive a similar diag- 
nostic that can be used on higher redshift spectra, but it may be 
the goal of a future work. 

This paper is organized as follows. We first present the data 
and how we selected them (Sect. [2]), then we summarize of ex- 
isting classification schemes (Sect.©. In Sect.[4l we discuss the 
limits of the LIO and DEW diagnostics. Finally we present our 
supplementary diagnostic in Sect. [5] 

2. Data selection 

We used a sample of 868 492 ga la xies from the SDSS 
(Data Release 7, lAbazajian et aP [20091 available at: 
^ttp : / / www . mpa- gar ching . mpg . de/ SDSS/DR7/) with 
redshifts between 0.01 and 0.3. Actually the sample originally 
contained measurements for 927 552 different galaxies, but 
there are 109 219 duplicate spectra (twice or more), so we 
averaged these duplicated measurements in order to increase 



the signal-to-noise ratio, and filtered out those that do not 
increase the averaged signal-to-noise ratio. Among others, these 
data contain measurements of the equivalent widths of the 
following emission lines: [Oiii]A5007, [Oii]AA3726+3729, 
[Nii]A6583, [Sii]AA6717+6731, Hj3, Ha, and [NeIII]A3969. 
B aimer emission-line measurements were automatically cor- 
rected for any underlying absorption. The spectral coverage of 
SDSS is 3800-9200 A, and the mean resolution of the spectra 
1800 < A/AA 5, 2200. We also retrieved the value of the 
Dn{4000) index ('Balo gh et al.|[l999[) . which were measured on 
emission-line subtracted spectra. 

We filtered our data for a specific signal-to-noise ratio 
(in equivalent width) greater than five in order to keep the 
same selection as in Paper I, also eliminating data with pos- 
itive equivalent width, which would involve absorption lines. 
We did not apply this selection to the [NeIII]A3969, which 
is only used as an optional measurement in the DEW diag- 
nostic. This finally leads to 89 379 galaxies. All classifica- 
tions and plots presented in this paper were processed by the 
"JClassif" software, part of the "Galaxie" pipeline available at: 
http : //www. ast . obs-mip.fr/galaxie/, 

Throughout this paper, as in Paper I, all emission line ra- 
tios are equivalent width ratios rather than flux ratios. This is 
done to eliminate any dependence that may exist (mainly for 
the [Oll]/Hj3 emission line ratio) between the derived diagnos- 
tic and the dust properties of the sample. Indeed, equivalent 
width ratios are sensitive not to dust attenuation, but only to the 
ratio between continuum fluxes below each lines. Considering 
[On] and Hj3, this parameter should not evolve strongly be- 
tween galaxies with similar properties in the diagnostic dia- 
grams, even if they are at different re dshifts, keeping the con- 
sistency of the diagnostic s (see also [Lamareille et al.[ [2006b[: 
[Perez-Montero et al 
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3. Existing classification schemes 

3.1. K06 diagnostic 

As i n Paper I, we use the a simplified version of the diagnostic 
from i Kewley et al.[ ([20061 hereafter K06) as the reference clas- 
sification. In the two main K06 diagrams, we use the following 
demarcation lines: 

log([Oiii]/Hj3) = 0.61/[log([Nii]/Ha) -0.05] + 1.30, (1) 
where AGNs are above this curve, and 

log([Oiii]/Hj3) = 0.61/[log([Nii]/Ha) - 0.47] + 1.19, (2) 

with SFGs below this curve. Composites fall between these two 
curves. Moreover, AGNs can be subclassified into Seyfert 2 and 
LINERs using the line 



log ( [Oiii] /Hj3 ) = 1 .89 X log ( [Sii] /Ha) + 0.76. 



(3) 



Seyferts 2 are above this line, LINERs are below. 

Our K06 diagnostic is simplified since it does not use the last 
log([Oiii]A5007/Hj3) vs. log([Oi]A6300/Ha) diagram. This is 
a reasonable approximation since the [Ol] A6300 emission line is 
weaker than the others, hence not detected in most intermediate- 
and high-redshift spectra where the signal-to-noise ratio is typi- 
cally low. 



2 



J. Marocco et aL: Spectral classification of emission-line galaxies from the Sloan Digital Sky Survey 



3.2. LIO diagnostic 

The LIO diagnostic has been defined in Paper I. We summarize 
here the main equations, but we refer the reader to Paper I for 
details. The first equation separates SFGs from AGNs: 

log ( [Oiii] /Hj3 ) = 0. 1 1 / [log ( [On] /Hj3 ) - 0. 92] + 0. 85 , (4) 

where AGNs are above this curve. The second equation sepa- 
rates Seyfert 2 from LINERs in the AGN region: 

log([Oiii]/Hj3) = 0.95 X log([Oii]/Hj3) - 0.40. (5) 

Seyferts 2 are above this line. Then, we define a region where 
some Seyfert 2 (26% of them) are mixed with a majority of 
SFGs (21.5% contamination by Seyfert 2). This region, called 
"SFG/Sy2", is located below Eq.|4land above the line 

log([Oiii]/Hj3) = 0.30. (6) 

Finally, we define the region where most of the composites fall 
(64% of them), even if this region is dominated by SFGs (79%) 
and also contains some LINERs (2%). This region, called "SFG- 
LIN/comp", can be located by the two following inequalities: 

log([Oiii]/Hj3) < -(x-l)^-0.1x + 0.25, (7) 

log([Oiii]/Hj3) > (x- 0.2)2 -0.60, 

with X = log([Oll]/Hj3). Unlike in Paper I, we now divide 
the "SFG-LIN/comp" region for clarity into "SFG/comp" and 
"LIN/comp" regions, and the separation between SFG and 
LINERs is done according to Eq. IH 

3.3. DEW diagnostic 

The D EW diagnostic has been proposed by IStasinska et al.l 
(^2006") and involves the DEW diagnostic diagram, showing the 
Z)„(4000) index vs. the maximum (in absolute value) of the 
equivalent widths of [Oil] and [Nelll] emission lines. We sep- 
arate AGNs from SFGs using 

Z)„(4000) = -0.15y+L7, (9) 

with y = log (max (EW [Oil] , EW [Neiii] ) ) + 1 , where AGNs are 
above this line. This diagnostic is based the Z)„(4000) index be- 
ing an indicator of the mean age of the stellar populations. Thus, 
it is indeed useful to separate galaxies that are dominated by 
older stars (AGNs) from galaxies dominated by younger stars 
(SFGs). The DEW diagnostic also considers that the [Nelll] 
emission line may be stronger than the [Oil] emission line in 
AGNs. Thus, it should be used as a good additional tracer for 
AGNs in low signal-to-noise ratio surveys. 

3.4. Summary 

Figure [T] shows how the different types of galaxies (according 
to K06) appear in the high-redshift diagrams (top panels) and 
how the high redshift classifications appear back in one of the 
K06 diagnostic diagrams (bottom panels). In all panels, SFG are 
plotted in blue, Seyfert 2 in green (except in the bottom-right 
panel where green points stand for all types of AGNs), LINERs 
in cyan, and composites in magenta. The LIO diagnostic (left 
panels) implies several regions where different types of galaxies 
get mixed. Seyfert 2 region and LINERs region are now quite 
well defined, but we see composites falling in the SFGs and 



Table 1. Success chart of the LIO diagnostic, where numbers are 
the probability that a given reference K06 type goes in a given 
LIO type. 



reference K06 classification 


LIO classification 


SFG 


Composites 


Seyfert 2 


LINERs 


total 


100 


100 


100 


100 


Seyfert 2 


0.08 


0.19 


59.27 


3.81 


SFG/Sy2 


3.94 


1.13 


26.25 


0.12 


SFG 


40.24 


27.79 


5.90 


2.44 


SFG/comp 


55.47 


55.15 


0.14 


2.93 


total SFG"" 


99.65 


84.07 


32.28 


5.50 


LINERs 


0.17 


6.94 


8.00 


73.51 


LIN/comp 


0.11 


8.81 


0.44 


17.18 


total LIN** 


0.27 


15.74 


8.44 


90.70 



* union of SFG/Sy2, SFG, and SFG/comp regions. 
** union of the LINERs and LIN/comp regions. 



LINERs regions. Most of the composites fall in the region of the 
LIO diagnostic called SF-LIN/comp(marked by the dashed con- 
tour corresponding to Eqs.|7]and[8]l. SFGs and Seyfert 2 are now 
separated quite well, but still there is a small region of the LIO 
diagnostic, called SF/Sy2, where they get mixed. In the bottom- 
left panel, it seems that most of the SF/Sy2 galaxies belong to the 
K06 SFG region, and that a large number of SFG/comp galaxies 
belong to K06 SFG region. LIN/comp galaxies seem to appear 
half/half in the K06 composites and LINERs regions. 

We now the compare K06 and DEW classifications (right 
panels). We see that all K06 LINERs are correctly classified as 
AGNs in the DEW diagnostic. Most of K06 Seyfert 2 galax- 
ies lie in the DEW AGN region as well, so that is quite satis- 
fying. However composites are shared in DEW SFG and AGN 
regions, which confirms that composites are sort of hybrids be- 
tween AGNs and SFGs, also in terms of stellar populations. Thus 
they obviously cannot be isolated in the DEW diagnostic. We 
emphasize that the de finition of SFG and AGN galaxies used in 
IStasinska et al.l (12006*) is slightly different from the K06 scheme: 
it is based on the long-dashed curve shown in the bottom-right 
panel of Fig. [T](Eq. 11 of their paper). The DEW diagnostic is 
designed to exclude only pure SFGs without an y AGN contri- 
bution, while according to IStasinska et al.l (l2006h galaxies clas- 
sified as SFG by K06 (and by us) would allow up to 3% AGN 
contribution. Composites would allow up to 20% AGN contri- 
bution. 

Indeed in the bottom-right panel, a non-negligible number 
of DEW AGNs actually belong to the K06 SFG or composite 
regions. However, we also note conversely that a non-negligible 
number of DEW SFGs contaminate the K06 composite and 
AGN regions. The DEW diagnostic actually fails to completely 
exclude all pure SFGs. 

4. Limits of the LIO and DEW classifications 

4.1. Success and contamination charts 

The success chart consists in classifying galaxies from our sam- 
ple according to the reference, then associating a probability for 
each type of galaxy (AGN, composite, or SFG) to be classified 
correctly in the new diagnostic. The contamination chart is based 
on the same principle as the success chart, except this time we 
classify galaxies according to the new diagnostic, and then we 
calculate the probability that the galaxies classified as one type 
are actually of that same type according to the reference. Table [T] 
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Figure 1. Summ ary of existing schemes for the classification of emission-line galaxies at high redshift. Top: results of the refer- 
ence diagnostic (iKewlev et all 2006) are shown in the log([Oiii]A5007/Hj3) vs. the log([Oii]AA3726 + 3729/Hj3) diagram (left) 
and in the Z)„(4000) vs. max(EW[Oll],EW[Nelll]) diagram (right). The code is blue: SFG; green: Seyfert 2; cyan: LINERs; ma- 
genta: composites. Bottom: The results of the high-re dshift classificat ions are shown in the standard log([Qlll]A5Q07/ Hi3) vs. 
log([Nii]A6583/Ha) diagram. Left: the LIO diagnostic (Lamareille 2010). Right: the DEW diagnostic ( Stasi6ska etaDl2Q0 6). Same 
color code as above except green: stands for all AGNs in the bottom-right panel; light blue: SFG/co mp; yellow: LIN/comp ; orange: 
SF/Sy2.In bottom-right panel, the long dashed line is the boundary between SFG and AGN used by lStasinska et al.l (l2006h . 



Table 2. Contamination chart for the LIO diagnostic, where 
numbers are the probability that a given LIO type actually is any 
of the reference K06 types. 



reference K06 classification 



LIO classification total SFG Comp. Seyfert 2 LINERs 



Seyfert 2 


100 


2.73 


1.29 


86.71 


9.28 


SFG/Sy2 


100 


74.05 


4.30 


21.48 


0.17 


SFG 


100 


86.90 


12.17 


0.55 


0.38 


SFG/comp 


100 


82.96 


16.72 


0.01 


0.32 


total SFG"" 


100 


84.10 


14.38 


1.19 


0.34 


LINERs 


100 


2.28 


19.40 


4.80 


73.51 


LIN/comp 


100 


3.37 


56.57 


0.61 


39.46 


total LIN'''' 


100 


2.61 


30.68 


3.53 


63.18 



* union of SFG/Sy2, SFG, and SFG/comp regions. 
** union of the LINERs and LIN/comp regions. 

shows the success chart of the LIO diagnostic. It reveals a rela- 
tively satisfying spread of composite galaxies and AGNs inside 
the different types defined. Table[2]shows the associated contam- 
ination chart. We notice quite good efficiency, i.e. low contam- 



ination by other types, in the LIO SFG, Seyfert 2, and LINER 
regions. 

If we take a look at AGNs, we notice that almost 60% of K06 
Seyfert 2 galaxies are successfully classified as LIO Seyfert 2. 
Moreover 26% belong to the LIO SFG/Sy2 region, which would 
give us a total of more than 85% of K06 Seyfert 2 galaxies be- 
ing classified as Seyfert2 with the LIO diagnostic. However, the 
contamination chart shows that the LIO SFG/Sy2 region is actu- 
ally made up of only 21% K06 Seyfert 2, which means it cannot 
be used to reliably look for additional Seyfert 2 galaxies in high- 
redshift samples. The Seyfert 2 region itself shows a very low 
contamination (13%) by other types. 

Most K06 LINERs (74%) are also successfully classified as 
LIO LINERs, and 17% belong to the LIO LIN/comp region. That 
gives a global success rate of 91% for LINERs. As already stated 
in Paper I, these results are much better than results produced 
by the former L04 diagnostic; however, the contamination by 
composites in the LIN/comp region is not negligible. Only 40% 
of the LIO LIN/comp objects actually are K06 LINERs, and 57% 
are K06 composites. That gives a global 37% contamination in 
the union of the LIO LINERs and LIN/comp regions. 
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Table 3. Success chart for DEW diagnostic 



reference K06 


DEW 


SFG 


Composites Seyfert 2 


LINERS 


total 


100 


100 100 


100 


SFG 


94.95 


38.08 20.24 


2.16 


AGN 


5.05 


61.92 79.76 


97.84 



Table 4. Contamination chart for DEW diagnostic 



reference K06 


DEW 


total 


SFG 


Composites Seyfert 2 


LINERS 


SFG 


100 


91.56 


7.44 0.85 


0.15 


AGN 


100 


17.94 


44.57 12.32 


25.17 



Finally we confirm the conclusion of Paper I from these 
two tables, which is that the LIO diagnostic is very efficient 
for SFGs. If we consider the union of the LIO SFG, SFG/Sy2, 
and SFG/comp regions, the success rate is 99.7% and the con- 
tamination by other types only 16%. This low contamination by 
composite galaxies and AGNs, in particular in the SFG/Sy2 and 
SFG/comp regions, has be en show n to not critically bias SFG 
studies such as metallicity. lLamareille et al .1 (2009), for instance, 
performed such tests using the L04 diagnostic, i.e. with an even 
more contamination by AGNs in the SFG region. 

Tables [3] and |4] show the success and contamination charts of 
the DEW diagnostic. Again, the results are very good for SFGs 
with a 95% success rate, and only an 8% contamination, less 
than for LIO. Still, this better contamination chart for SFGs does 
not drastically reflect a worse success rate for AGNs. The suc- 
cess rate is indeed 80% for Seyfert 2 and 98% for LINERs. The 
DEW has in fact greater ability to separate SFGs from AGNs 
than standard diagnostic diagrams as in K06 or LIO classifi- 
cations. However, the main limitations of the DEW diagnos- 
tic clearly appear from the contamination chart regarding DEW 
AGNs. The DEW AGN region is actually made up of only 37% 
K06 AGNs. There is indeed a high contamination by 18% K06 
SFGs, much higher than with the LIO diagnostic (less than 3% 
in the Seyfert 2 and LINERs regions). Moreover, the DEW AGN 
region is contaminated by 45% K06 composites, to be compared 
to 30% for the LIO LINERs, and only 1% for the LIO Seyfert 
2. As one can see in Fig. [H composites get completely con- 
fused with Seyfert 2 and LINERs in the DEW diagnostic, while 
they are rather confused with SFGs in the LIO diagnostic. We 
do note that this co ntamination is explained mainly by the fact 
that lStasinska et al.l (|2006) use a different definition of SFGs and 
AGN galaxies (as discussed in Sect. [331 above). Indeed, in DEW 
diagnostic's philosophy, it should not be considered as a "con- 
tamination" but as a "contribution" of an AGN to star-forming 
galaxies. Finally, the DEW diagnostic does not allow any dis- 
tinction between Seyfert 2 and LINERs. 

Regarding the classification of SFGs, we conclude that one 
should use the LIO diagnostic for its very high success rate and 
DEW diagnostic for its lower contamination. About AGNs, both 
advantages of the LIO and DEW diagnostic diagrams may be 
put together to provide a better diagnostic, which is the goal of 
the present paper. We emphasize that we did not use the DEW 
diagnostic itself for our own classification. We only used the 
DEW diagram to derive a new diagnostic where it is needed (see 
Sect.gSlbelow). 



4.2. AGN counts 

In order to explore the limitations of LIO and DEW classifi- 
cations of AGNs better, we now count the number of AGNs 
(Seyfert 2 and LINERs) as a function of the ionization state, 
roughly given by the log ([Olll]/Hj3) emission-line ratio. To 
achieve this test, we divide the K06 or LIO diagnostic diagrams 
in equal horizontal slices, and then in each slice we count the 
number of AGNs. Figure |2] shows the absolute and difference 
counts (relative to K06) obtained with, from left to right, the 
following classifications: L04, DEW, LIO, and the present pa- 
per's diagnostic (see Sect. [5] below). In each panel, the results 
are compared to the actual count of AGNs according to the ref- 
erence K06 diagnostic. 

We confirm, as stated in Paper I, that the L04 diagnostic 
tends to underestimate the amount of AGNs, even when in- 
cluding L04 candidate AGNs, and that a very high number of 
AGNs (mainly LINERs) are lost in this diagnostic. However, 
we can put this effect into context thanks to Fig. O as we see 
that it only becomes significant for log([Olll]/Hj3) < 0.9, or 
log ( [Oiii] /Hj3 )< 0.7 if we include candidate AGNs. For AGNs 
with a high ionization state, the L04 diagnostic indeed gives per- 
fect results. 

Figure [2] also shows that DEW and LIO classifications are 
doing quite well by following the K06's curve almost exactly 
for log ([Oiii]/Hj3) > 0.25. In both cases, we notice an underes- 
timate of the number of AGNs for 0.25 < log ( [Oiii] /Hj3 ) < 0.7, 
where this effect is more significant for the LIO diagnostic 
than for the DEW diagnostic. Nevertheless, the DEW diagnos- 
tic clearly overestimates the number of AGNs in low ionization 
states, i.e. mainly LINERs (log([Olll]/Hj3) < 0.25). In this re- 
gion, the LIO diagnostic, in contrast, satisfyingly follows the 
K06's curve, with a small underestimate. 

Unfortunately, including the SFG/Sy2 and LIN/comp re- 
gions does not help. It makes a peak of galaxies at 
log ([Oiii]/Hj3) '2^ 0.4 appear that does not fit the reference pro- 
file, while in a lower ionization state the number of AGNs is now 
clearly overestimated. Those two effects are from the high con- 
tamination of the LIO SFG/Sy2 region by K06 SFGs and of the 
LIO LIN/comp region by K06 composites. 



5. The supplementary Mil diagnostic 

Figure [3] shows four regions where galaxies of different types 
(according to the reference K06diagnostic) are confused in the 
log([Oiii]A5007/Hj3) vs. log([Oii]AA3726 + 3729/Hj3) dia- 
gram. It shows also in its center and right panels how these 
galaxies behave in the Z)„(4000) vs. max(EW[Oii],EW[Neiii]) 
diagram. From top to bottom, the four studied regions are 
SFG/Sy2, SFG/comp, another SFG/comp region not defined in 
the LIO diagnostic but where a non negligible number of the 
composites (25%) are still mixed with SFGs, and LIN/comp. 

5.1. The SFG/Sy2 region 

In the LIO SFG/Sy2 region (see Fig. Stop), K06 Seyfert 2 and 
SFGs are confused. It is unfortunately obvious in the bottom- 
left panel that the DEW diagnostic does not separate the two 
classes of objects correctly in the LIO SFG/Sy2 region. We thus 
propose a new demarcation line to separate K06 Seyfert 2 from 
SFGs, valid only in the LIO SFG/Sy2 region, with the equation 

Dn{4000) = 0.22 X log(max(EW[Oii],EW[Neiii])) +0.97.(10) 
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Figure 2. Comparison between the number of AGNs (Seyfert 2 and LINERs) counted with different calibrations at high redshift, as 
a function of log([Oiii]A5007/Hj3). From left to right, the calibrations used are L04, DEW, LIO (paper I), and the present paper. 
Top panels show absolute counts. Bottom panels show difference counts. In each panel, the reference counts established with the 
K06 diagnostic are shown in red, and the counts obtained with the high-redshift diagnostic are shown in black. The gray dashed 
lines show the results when including "candidate" regions, i.e. for L04, LIO, and the present paper's classifications: candidate AGNs 
region; SFG/Sy2 and LIN/comp regions; LIN/comp region. 



Table 5. Success chart for the supplementary Mil diagnostic in 
the LIO SFG/Sy2 region. 



reference K06 


Mil 


SFG Seyfert 2 


total 


100 100 


SFG 


99.10 3.10 


Seyfert 2 


0.90 96.90 



Seyfert 2 would fall above this line, SFG below. The slope and 
zero point of this line have been optimized by minimizing on a 
grid the following function: 

^2^(^l-S^f + {\- Ssf + {C^f + {Cb?, (11) 

where S^,Sb,C^, Cb are the success rate for AGN above the de- 
fined line, the success rate for SFG below the line, the contam- 
ination by SFG above the line, and the contamination by AGN 
below the line (all values between and 1), respectively. Indeed, 
we want to maximize the success rates and minimize the con- 
tamination at the same time above and below the defined line. 
The grid is done in O.Oldex steps in both slope and zero point. 
To minimize computer time, limits on this grid are defined by 
eye. 

We have established the success of our new diagnostic in the 
LIO SFG/Sy2 region (see Table 0. This chart shows that our de- 
marcation line works almost perfectly and can be used in that 
area. We have now correctly classified almost all actual Seyfert 
2 in our sample: 97% of those in the LIO SFG/Sy2 region are 
classified as Seyfert 2. Given that 59% of the K06 Seyfert 2 in 
the whole sample were already correctly classified, and 26% of 
them classified as LIO SFG/Sy2, this increases the global suc- 
cess rate to 85%. This is the best success rate one can obtain by 
combining LIO and DEW diagrams. The contamination in the 



SFG/Sy2 region is made of 3.1% SFGs above the line defined 
by Eq. [inland 0.9% Seyfert 2 below it. 

5.2. The SFG/comp region 

The LIO SF/comp (see Fig. [3] second line) contain composites, 
SFGs, and very few LINERs. Following the optimization pro- 
cedure explained above, we find the following equation which 
separate as many SFGs as possible from composites: 

Dn (4000) = -0. 1 1 X log (max (EW[Oii] , EW[Neiii] )) + 1 .4, (12) 

where SFGs are below this line. Since SFGs dominate the sam- 
ple, the region below this line is composed of 98% SFGs. 
Conversely, the region above this line is still a mix between 
SFGs (59%), composites (36%), and a few LINERs (4%). We 
again applied the optimization procedure but now only consider 
the latest region, in order to isolate pure composites as much as 
possible. We obtain: 

Dn (4000) = - 0. 1 7 X log (max (EW [On] , EW [Neiii] )) + 1.75,(13) 

where composites are above the line, SFG/comp below it (i.e. 
between the two lines). 

Tables [6] and [T^how the success chart and the contamina- 
tion chart of the supplementary Mil diagnostic, in the LIO 
SFG/comp region. From all the K06 SFGs present in this re- 
gion, 64% are now correctly classified as SFGs, another 36% be- 
ing still ambiguously classified as SFG/comp. is unfortunately 
impossible to increase this success rate without misclassifying 
too many K06 composites as SFGs. Only 14% of K06 com- 
posites could be isolated. Newly isolated SFGs are not signif- 
icantly contaminated by K06 composites (2%), and neither are 
the newly isolated composites by K06 SFGs (6%). The majority 
of the K06 of composites (81%) are still ambiguously classified 
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Figure3. Left: In the log([Oiii]A5007/Hj3) vs. log([Oii]AA3726 + 3729/Hj3) diagram, the different regions where galaxies of 
different types are mixed (color-code according to the reference K06 diagnostic). The curves are the ones from the Lamareilld (l20 1 Oh 
diagnostic. Center and right: The same points as in the associated left panels, but now in the Dn (4000) vs. max (EW[Oll] , EW[Nelll]) 
diagram. For clarity, we plot the different types in two panels.The solid li nes show the adopted demarcation lines (see text for details). 
The dashed line in top-right panels is the separation adopted by Stasinska et al.l (12006) . 
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Table 6. Success chart for the supplementary Mil diagnostic in 
the LIO SFG/comp region. 



reference K06 


Mil 


SFG 


LINERS 


composites 


total 


100 


100 


100 


SFG 


63.69 


3A1 


5.31 


composites 


0.19 


70.14 


13.95 


SFG/comp 


36.12 


26.39 


80.74 



Table 7. Contamination chart for the supplementary Mil diag- 
nostic in the LIO SFG/comp region. 



reference K06 


Mil 


total 


SFG LINERS 


composites 


SFG 


100 


98.30 0.02 


1.67 


composites 


100 


5.85 8.21 


85.93 


SFG/comp 


100 


68.81 0.19 


31.00 



Table 8. Success chart for the supplementary Mil diagnostic in 
the additional region of the LIO diagnostic where SFG are mixed 
with composites. 



reference K06 


Mil 


SFG composites 


total 


100 100 


SFG 


92.10 12.69 


composites 


2.90 61.97 


SFG/comp 


5.00 25.34 



as SFG/comp. The best reason to use this diagram is clearly for 
isolating SFGs. The SFG/comp region is now made of 69% K06 
SFGs and 31% K06 composites, which is more balanced than in 
Paper I (respectively 83% and 17%). 

5.3. Additional region of composites mixed with SFGs 

We define a last region where K06 composites are mixed with 
K06 SFGs in the LIO diagnostic. It is located below the SFG/Sy2 
region, and above log([Oiii]A5007/Hj8) > -0.4, excluding the 
SFG/comp region (see Fig.[3]third line). Even though we can see 
that these K06 SFGs are mostly spread over the bottom right of 
the DEW diagnostic diagram and K06 composites are slightly 
above, there is still a small area where they get together. 

Following the same optimization procedure as above, we 
first fit the optimized demarcation line between pure SFGs and 
SFGs mixed with composites: 

Z)„(4000) = 0.44 X log(max(EW[Oii],EW[Neiii])) +0.72,(14) 

where SFGs are below this line. As in previous section, the re- 
gion above this line contains a mix of SFGs (25%) and compos- 
ites (75%). In this region, we fit another optimized demarcation 
line between pure composites and composites mixed with SFGs: 

Z)„(4000) = -0.37 xlog(max(EW[Oii],EW[Neiii])) + 1.66, (15) 

where composites are above this line, and SFG/comp below it 
(i.e. between the two lines). We finally add the remaining mixed 
galaxies to the SFG/comp type. 

Tables [6l and [9] show the success and contamination charts of 
the supplementary Mil diagnostic in this last region. Of all K06 
SFGs present in this region, 92% are now correctly classified, 
and another 5% are ambiguously classified as SFG/comp. We 



Table 9. Contamination chart for the supplementary Mil diag- 
nostic in the additional region of the LIO diagnostic where SFG 
are mixed with composites. 



reference K06 
Mil total SFG composites 
SFG Too 9639 5:61 
composites 100 14.50 85.30 
SFG/comp 100 42.08 57.92 



also managed to isolate 62% of the K06 composites falling in 
this region, which is an improvement compared to the LIO diag- 
nostic. Conversely, 12% of the K06 composites are now unfor- 
tunately misclassified as SFGs. This is still an improvement : we 
recall that 100% of the K06 composites in this region used to be 
classified as SFGs with the LIO diagnostic. Most of them (62%) 
are now classified as composite galaxies, and another 25% are 
still ambiguously classified as SFG/comp. The composites de- 
fined in this region are contaminated by 15% K06 SFGs, which 
may not be neglected. The SFG/comp galaxies defined in this re- 
gion are made of approximately half K06 SFGs (42%) and half 
K06 composites (58%). 

5.4. The LIN /com p region 

Figure [3] (bottom) shows the LIN/comp region of the LIO diag- 
nostic in the DEW diagnostic diagram. It is clear that this dia- 
gram cannot be used to isolate LINERs cleanly from composites. 
Indeed, one may argue that LINERs are more concentrated on 
the top and composites on the bottom of the diagram, so we pro- 
pose a straight line at Z)„(4000) ^ 1.75 as a separation. Using 
this line we find, however, 61% LINERs and 39% composites 
above, 28% and 72% below respectively, which is unsatisfac- 
tory. Thus, we do not update the LIN/comp region as in Paper I. 

5.5. Discussion 

Figure |4] shows our supplementary Mil diagnostic combined 
with LIO diagnostic, in one of the standard K06 diagnostic 
diagrams. The left panel looks pretty good: SFGs Seyfert 2, 
LINERs, and some composites lie almost perfectly in the cor- 
rect corresponding regions of this diagram. In contrast, the right 
hand panel shows the limitations: objects that are still ambigu- 
ously classified or as SFG/comp or as LIN/comp. Anyway, com- 
paring Fig. |4]to the bottom left hand panel of Fig. [T] we see a 
clear improvement. 

Tables [TO] and [TT] establish new contamination and success 
charts in order to get a more precise measurement of this im- 
provement. Success chart shows that 1 % of K06 SFG are clas- 
sified in a region where they are not supposed to be. K06 com- 
posites are predominantly found in the SFG/comp region (51%) 
than in the composite region (23%). This is an improvement over 
Paper I, but it shows that neither the LIO nor the DEW diagrams 
are really good at identifying composites at high redshift. K06 
Seyfert 2 galaxies and LINERs have a high success rate (respec- 
tively 85% and 74%), which is for Seyfert 2 galaxies a really 
good improvement compared to Paper I. For K06 LINERs, the 
success rate increases to 91% including the LIN/comp, but one 
has to be aware that this region is actually made of only 39% 
K06 LINERs and is dominated by 57% K06 composites. 

Nevertheless, one can conclude from the contamination chart 
that SFGs, Seyfert 2, and LINERs are not significantly contam- 
inating each other. The main contamination comes in all cases 
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Figure 4. Results of the new diagnostic derived in the present paper in the standard log([Oiii]A5007/Hj3) vs. log ([Nil] A 65 83 /Ha) 
diagram. For clarity, only SFGs, AGNs and composites are shown in the left panel, while SFG/comp and LIN/comp are shown in 
the right panel. Same color code as in Fig. [T] 



Table 10. Overall success chart for the LIO and Ml 1 diagnostics. 



reference K06 



LlO/Mll 


SFG 


Composites 


Seyfert 2 


LINERS 


total 


100 


100 


100 


100 


SPG 


77.97 


8.92 


0.92 


0.10 


SFG/comp 


20.99 


50.97 


0.47 


0.98 


composites 


0.66 


23.44 


5.46 


4.30 


Seyfert 2 


0.12 


0.93 


84.71 


3.93 


LINERS 


0.17 


6.94 


8.00 


73.51 


LIN/comp 


0.11 


8.81 


0.44 


17.18 


total SFG^ 


98.96 


59.88 


1.39 


1.08 


total LINERs^ 


0.27 


15.74 


8.44 


90.70 


total comp. ^ 


21.75 


83.22 


6.38 


22.46 



:!>ro+:^ro/com 
' LINERs+LIN/comp 
' composites+SFG/comp+LIN/comp. 



Table 11. Overall contamination chart for the LIO and Mil di- 
agnostics. 



reference K06 


LlO/Mll 


total 


SFG 


Composites 


Seyfert 2 


LINERS 


SFG 


100 


97.68 


2.26 


0.05 


0.01 


SFG/comp 


100 


66.81 


32.89 


0.07 


0.23 


composites 


100 


11.02 


79.77 


3.99 


5.23 


Seyfert 2 


100 


2.73 


4.42 


86.20 


6.66 


LINERS 


100 


2.28 


19.40 


4.80 


73.51 


LIN/comp 


100 


3.37 


56.57 


0.61 


39.46 


total SFG^ 


100 


75.83 


15.37 


3.30 


5.50 


total LINERs^ 


100 


2.61 


30.68 


3.53 


63.18 


total comp.^ 


100 


53.67 


41.63 


0.68 


4.02 



1 SFG+SFG/com 

2 LINERs+LIN/comp 

3 , 



' composites+SFG/comp+LIN/comp. 



from composites: 33% in the SFG/comp region, 19% in the 
LINERs region, and 57% in the LIN/comp region. It is con- 
versely very low in the SFG and Seyfert 2 regions. 

One could worry about aperture effects. Indeed, SDSS spec- 
tra are based on 3" fibers. This may end in overestimated 
Z)„(4000) values for close objects where only the central bulge 



is covered by the fiber. However, one can see in Fig. [3] that 
only objects with Z)„(4000) < 1.5 could change their classifi- 
cation with a significant aperture effect. Looking at Fig. 16 in 
iKauffmann et al.l J2003bl) . we see that those objects do not actu- 
ally suffer from a strong aperture effect. We conclude that our 
diagnostic is not biased by this effect. 

Finally, we invite the reader to take a look at the right hand 
panels of Fig.[2j It shows the AGN counts obtained with our new 
supplementary Mil diagnostic combined with LIO diagnostic. 
We clearly see that the diagnostic derived in the present paper is 
the one that follows the reference K06 curve more accurately. 
There are still problems for log ([Oiii]/Hj3) < 0.25, which is 
normal since we did not manage to change the classification of 
LINERs by adding the DEW diagnostic diagram. 

6. Conclusion 

By adding the Mil diagnostic to the LIO diagnostic derived in 
Paper I, we now have a very good classification of emission-line 
galaxies that can be used on high-redshifts samples. The main 
improvements compared to Paper I are 

- The unambiguous classification of objects in the former 
SFG/Sy2 region as SFGs or Seyfert 2. 

- The unambiguous classification of some of the objects in the 
SFG/comp region as SFGs or composites (where no com- 
posites at all were found in Paper I). 

- A better definition of the SFG/comp region, which leaves 
fewer possible composites not flagged as such. We empha- 
size again that this region is in any case dominated by SFGs. 

No improvements could have been done in the LIN/comp region, 
which is left unchanged compared to Paper I. 

In order to use the diagnostic derived in this paper, one 
should follow these steps. 



1. 



3. 



Classify objects in the log([Oiii]A5007/Hj3) vs. 
log([Oii]AA3726 + 3729/Hj3) with the LIO diagnostic 
derived in Paper I (see also equations in Sect. 13.21 ). 
Classify objects falling in the SFG/Sy2 region as SFGs or 
Seyfert 2 using Eq. [TOl 

Isolate objects falling the the SFG/comp region as 
(a) SFGs using Eq.[T2l and 
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(b) composites using Eq. \T3\ 
4. Define a new SFG/comp region inside the SFG region us- 
ing log ([Oiii] A5007/HJ3) > -0.4, so inside this new region, 
isolate 

(a) SFGs using Eq.fM and 

(b) composites using Eq. \T5\ 

In both points (3) and (4) above, objects not classified as SFGs 
or composites remain of the ambiguous SFG/comp type. We in- 
vite the reader to look at the "JClassif" software (available at: 
http://www.ast.obs-mip.fr/galaxie/), which performs 
these steps automatically on any sample, as well as other classi- 
fication schemes. 

Table [TT]may be used as a probability chart showing whether 
each type in our diagnostic is one of the K06 reference types. 
However, we warn the reader that the relative proportions of 
SFGs, composites and AGNs in each regions of our diagnos- 
tic diagrams may evolve with redshift compared to the SDSS 
sample. Finally, we note that it is possible to upgrade this classi- 
fication to higher redshifts where [Olll] A5007 and HjS emission 
lines get red-shifted out of the spectra (1.0 < z < 1.5 on op- 
tic al spectra). To that goal one may use the equations provided 
bv lPerez-Montero et al.](l2007l) . which convert [Neiii]A3869 and 
US to [Oiii]A5007andHj8. 
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